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Description 

Background of the Invention 

5 This invention relates to a coagulation tnhitMtor known as tissue factor inhibitor (TFI) and alternatively as 
lipoprotein associated coagulation inhibitor <LACI). More particularly, the invention relates to a cDNA clone 
representing essentially the full size TFI. 

The coagulation cascade that occurs in mammalian blood comprises two distinct systems - the so- 
called intrinsic and extrinsic systems. The latter system is activated by exposure of blood to tissue 

10 thromboplastin (Factor III), hereinafter referred to as tissue factor (TF). Tissue factor is a lipoprotein that 
arises In the plasma membrane of many cell types and in which the brain and lung are particularly rich. 
Upon coming into contact with TF. plasma Factor VII or its activated form. Factor Vll,. forms a calcium- 
dependent complex with TF and then proteolytically activates Factor X to Factor Xb. and Factor IX to Factor 
IXo. 

15 Early studies concerning the regulation of TF-initiated coagulation showed that incubation of TF (in 
crude tissue thromboplastin preparations) with serum inhibited its activity in vitro and prevented its lethal 
effect when it was infused into mice. Extensive studies by Hjort, Scand. J. Clin. Lab. Invest. 9 , Suppl. 27. 
76-97 (1957), confirmed and extended previous work in the area, and led to the conclusion that an inhibitory 
moiety In serum recognized the Factor VII-TF complex. Consistent with this hypothesis are the facts that 

20 the inhibition of TF that occurs in plasma requires the presence of Ca^ (which is also necessary for the 
binding of Factor Vll/Vlla to TF) and that inhibition can be prevented arKl/or reversed by chelation of divalent 
cations with EDTA. More recent investigations have shown that not only Factor Vll^ but also catalytically 
active Factor Xa and an additional factor are required for the generation of TF inhibition in plasma or serum.* 
See Broze and Miletich. Blood 69 . 150-155 (1987). and Sanders et al.. Ibid., 66. 204-212 (1985). This 

25 additional factor, defined herein as tissue factor inhibitor (TFI), and altemativaly as lipoprotein associated 
coagulation inhibitor (LACI). is present in barium-absort>ed plasnta and appears to be associated with 
lipoproteins, since TFI functional activity segregates witti the lipoprotein fraction that floats when serum is 
centrifuged at a density of 1.21 g/cm^. According to Broze and Miletich. supra , and Proc. Natl. Acad. Sd. 
USA 84 . 1886-1890 (1987). HepG2 cells (a human hepatoma cell line) secrete an inhibitory moiety with the 

30 same characteristics as the TFI present in plasma. 

In copending application US Ser. No. 77.366. filed July 23. l987.(priority document of EP 0 300 988), a 
purffied tissue factor inhibitor (TFI) is disclosed which was secreted from HepG2 cells. It was found to exist 
in two forms, a TPh. migrating at about 37-40.000 daltons and a TFb at at>out 25-26,000 daftons. as 
determined by sodium dodecylsuHate polyacrylamide gel electrophoresis (SOJSdPAGE). A partial N-terminal 

35 amino acid sequence for the TFI was assigned as: 



X-X-Glu-Glu-Asp-Glu-Glu-His-Thr-Ile-Ile-Thr-Asp-Thr-Glu- 

40 ■ . 

16 27 
Leu-Pro*Pro-Leu-Iys-Leu-Met-His-Ser-Phe-(Phe)*Ala 

45 wherein X-X had not been determined. The disclosure of said application is incorporated henmn by 
reference. 

Brief Description of the Inverrtion 

50 In accordance with the present invention, the complete codir>g sequence of a cDNA clone representing 
essentially the full size tissue factor inhibitor (TFI) has been developed. 

Initially, human placental and fetal liver Xgtll cDNA libraries were screened with a rabbit polyclonal 
antiserum raised against a purified TFI. Immunologically positive clones were further screened for '^sj. 
Factor Xa binding activity. Seven clones were obtained which were immunologically and functionally active. 

55 The longest clone, placental-derived XP9. was 1.4 kitobases (kb) tong while tii other six wer 1.0 kb in 
length. Partial DNA sequencing showed the 1 .0 kb clones to have sequences klentical to part of the tonger 
1.4 kb clone. Nucleotide sequence analysis showed ttiat XP9 consisted of a 1432 basepair (bp) cDNA insert 
ttiat includes a 5'-noncoding region of 133 bp. an open reading frame of 912 bp, a stop codon. and a y- 
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noncodtng region of 384 bp. 

The cDNA sequenc encodes a 31,950 Datton protein of 276 amino acids which Includes 18 cysteines 
and 7 methionines. The translated amino acid sequence shows that a signal peptid of at)0ut 28 amino 
acids precedes the mature TFI protein. It will be und rstood that the "mature* TFI is defined to include both 
5 TFI and m thtonyl TFI by virtue of the ATG translational codon in th XP9 clone described herein. 

There are . three potential N-linked glycosylation sites In the TFI protein with the sequence Asn-X- 
Ser/Thr. wherein X can be any of the common 20 amino acids. These sites are at amino acid positions Asn 
145. Asn 195. and Asn 256. when the first methionine after the 5'-noncodlng region is assigned amino acid 
position + 1 . 

10 The translated amino acid sequence of TFI shows several discernible domains, including a highly 
negatively charged N-termtnal. a highly positively charged carboxy-terminal. and an intervening portion 
consisting of 3 homologous domains with sequences typical of Kunrtz-type enzyme inhibitors. Based on a 
homology study. TFI appears to be a member of the basic protease inhibitor gene superfamily. 

The original source of the protein material for developing the cDNA clone XP9 was human placental 

15 tissue. Such tissue is widely available after delivery by conventional surgical procedures. The xgtll (lac5 
ninS C1857 SI 00) used herein is a welHmown and commonly available lambda phage expression vector. Its 
construction and restriction endonuclease map is described by Young and Davis. Proc. Natl. Acad. Sci. USA 
§3.1194-1198(1983). 

Northern blot analysis showed that the following liver-derived cell lines: 

20 Chang liver. HepG2 hepatoma, and SK-HEP-1 hepatoma, all contained 2 major species of mRNA (1.4 and 
4.4 kb) which hybridized with the TFI cDNA. 

The ctoning of the cDNA for TFI and development of its entire protein sequeru» and structural domains 
as disclosed herein permits detailed structure-functional analyses and provides a foundation for study of its 
biosynthetic regulations. The invention thus is important to medical science in the study of the coagulation 

25 cascade with respect to agents which are able to inhibit Factor X» and Factor VHaHT enzymatic complex. 

Detailed Description of the Invention 

While the specification concludes with claims partcularly pointing out and distinctly claiming the subject 
30 matter regarded as forming the present invention, it is believed that the invention will t>e better understood 
from the following detailed description of preferred embodiments of the invention taken in conjunction with 
the appended drawings, in which: 

RG. 1 shows the screening of Xgtll clones with ^^l-Factor X^. Cloned phage lysates (0.1 ml) were 
• spotted on a nitrocellutose paper by suction using a dot blot apparatus. The nitrocellulose paper was 
35 then probed with ^^l-Factor X* and autoradiographed as described hereinafter. The clones that appear 
as dark spots are positive ctones ttiat bind ^^l-Factor (^ntrol xgtll (tower right comer) and ottier 
clones do not bind -Factor Xb- 

FIG. 2 shows a partial restriction map and sequencing strategy for the XP9 inserts. The scale at the 
bottom indicates the nucleotide position. The thick bar represents the coding region. The thin bars 
40 represent 5'- and 3'-noncoding regions. The restriction endonuclease sites were confirmed by digestion. 
The arrows show the overlapping Ml 3 clones used to sequence the cDNA. 

FIG. 3 shows the nucleotide sequence and translated amino acid sequence of the human TFI cDNA. 

Nucleotides are numbered on the left and amino acids on the right The urKieriined sequences have 

been independentiy confirmed by amino acid sequence analysis of the purified TFI protein and two Vs 
45 protease trypsin digested peptides. Amino acid 1 was assigned to the first methionine after a stop 

codon of the 5'>noncoding region. Potential N-tined glycosylation sites are marked by asterisks. 

FIG. 4 is a graphical representation which shows the charge distribution of the amino acid sequence in 

TFI. Charges are calculated from the first resMue to the i-th residues and displayed at the i-th residue. 

Thus the value of the i-th position is the summation of all charges from the first reskHue to the i-th 
so residue and the difference of the charges between the i-th and j-th residue (j>i) is the net charge of the 

fragment from i-th to j-th residue. 

FIG. 5 is a graphical representation which shows the hydrophobiclty profile of TR. The hydrophobidty 
profile was analyzed by a computer program whereby the hydrophobiclty index of the amino acid 
residues is defined as the depth to which an amino acid residue is buried inside a protein (from X-ray 
55 crystallographic data) (Kidera et al., J. Protein Chem. 4 , 23-55 (1985)]. The hydrophobiclty profil atong 
the sequence was smoothed using the program ICSSCU in IMSL Library [IMSL Library Reference 
Manual, 9th ed.. Institute for Mathematical and Statistical Subroutine Library, Houston. Texas (1982)]. 
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FIG. 6 shows an aligWment of th basic protease inhibitor domains of TR with other basic protease 
inhibitors. All the sequences except TFI were obtained from the National Biomedical Research Founda- 
tion Protein Sequence Database (Georgetown University. Washington. D.C.. release 13. Jun 1d87). 1. 
Bovine basic proteas inhibitor precursor; 2. Bovine colostrum trypsin inhibitor; 3. Bovine serum basic 

5 protease inhibitor; 4. Edible snail isoinhibitor K; 5. Red s a turtle basic protease inhibitor (only amino 
acids 1-79 presented); 6. Western sand viper venom basic protease inhibitor I: 7. Ringhals venom basic 
protease inhibitor 11; 8. Cape cobra venom basic protease inhibitor II; 9. Russell's viper venom basic 
protease inhibitor II; 10. Sand viper venom t>asic protease inhibitor III; 11. Eastern green mamba venom 
basic protease inhibitor I homolog; 12. Black mamba venom basic protease inhibitor B; 13. Black mamba 

10 venom basic protease inhibitor E; 14. Black mamba venom basic protease inhibitor I; 15. Black mamba a 
venom basic protease inhibitor K; 16. ^-1-Bungarotoxin B chain (minor): 17. /9*1-Bungarotoxin B chain 
(major); 18. ^-2-Bungarotoxin B chain; 19. Horse inter-a-trypsin inhibitor [amino adds 1*57(1): 58-123 (2)- 
); 20. Pig inter-ot-trypsin inhibitor [amino acids 1-57(1): 58-123(2)); 21. Bovine inter-o-trypsin inhibitor 
[amino acids 1-57(1); 58*123(2)]: 22. Human a-1-micrbglobulin/inter-a-trypsin inhibitor precursor [amino 

)5 adds 227-283(1): 284-352(2)]; 23. TFI [amino adds 47-117(1); 118-188(2); 210-280(3)]. Gaps were 
included in 16. 17. 18 to achieve best alignment. Standard one letter codes for amino acids are used. 
FIG. 7 shows the Northern blot analysis of RNAS from 3 liver-derived cell lines. Ten ug of polyCA)*^ RNA 
were used per lane. Lane 1 . Chang liver cell; lane 2. SK-HEP-1 hepatoma cell; larte 3. HepG2 hepatoma 
cell. 

20 Standard biochemical nomenclature is used herein in which the nucleotide bases are designated as 
adenine (A); thymine <T); guanine (G); and cytosine (C). Corresponding nucleotides are. for example. 
deoxyguanostne-5'-triphosphate (dGTP). As is conventional for convenience in the structural representation 
of a DNA nucleotide sequence, only one strand is shown in which A on one strand connotes T on its 
complement and G connotes C. Amino acids are shown either by three letter or one letter abbreviations as 

25 follows: 



Abbreviated Designation 


Amino Acid 


A 


Ala 


Alanine 


C 


Cys 


Cysteine 


D 


Asp 


Aspartic ackS 


E 


Glu 


Glutamic add 


F 


Phe 


Phenylalanine 


G 


Gly 


Glycine 


H 


His 


Hlsttdine 


1 


lie 


Isoleucine 


K 


Lys 


Lysine 


L 


Leu 


Leucine 


M 


Met 


Methionine 


N 


Asn 


Asparagine 


P 


Pro 


Proline 


Q 


Gin 


Glutamine 


R 


Arg 


Arginine 


S 


Ser 


Serine 


T 


Thr 


Threonine 


V 


Val 


Valine 


W 


Trp 


Tryptophan 


Y 


Tyr 


Tyrosine 



Commonly available restriction endonucteases described herein have the following restriction se- 
quences and (indicated by arrows) cleavage patterns: 
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70 



75 



20 



EcoRl 



Sspl 



Clal 



Alul 



Stul 



i 

GAATTC 
CTTAAG 
f 

TTATAA 

AI^GAT 
TAGCJPA 

TCGA 
t 

AGGCCT 
TCCGGA 
f 



In order to illustrate specific preferred embodiments of the invention in greater detail, the following 
exemplary laboratoiy preparative work was carried out. 



Example 1 



Materials 

30 Human placental and fetal liver cDNA iit)raries were obtained from Clonetech. The protoblot im- 
munoscreening kit was purchased from Promega Btotech. Restriction enzymes were from New England 
Biolabs. CaH intestine alkaline phosphatase. T4 ONA tigase, DNA polymerase I (Klenow). exonuctease 111 
and SI nuclease were from Boehringer Mannheim. dNTPs were from P. L Biochemicals. $ j-thlo- 

dATP (600 Cl/mmol) was from Amersham. Sequencing kit (Sequenase) was from United States Biochem- 

35 icals. Chang liver cells (ATCC CCL 13) and HepG2 hepatoma cells (ATCC HB 8065) were obtained from the 
American Type Culture Collection. SK-HEP-1 hepatoma cells were originally derived from a liver adenocar- 
cinoma by G. Trempe of Sioan-Kettering Institute for Cancer Research in 1971 and are now widely and 
readily available. 

^^I-Factor Xa was prepared by radiolabeling using lodo-gen. The specific activity was 2000 dpm/ng. 

40 Greater than 97% of radioactivity was precipitable with 10% trichloroacetic acid (TCA). The iodinated 
protein retained >80% of their catalytic activity toward Spectrozyme Xa (American Dtagnostica product). 

An anti-TFI-lg Sepharose® 4B column was prepared as foIk>ws: A peptide (called TFhpeptide) 
containing a sequence corresponding to the amino acid sequence d-25 of the mature TFl was synthesized 
using Biosystem*s solid phase peptide synthesis system. The TFI-peptide (5 mg) was conjugated to 10 mg 

45 of Keyhole lympet hemocyanin by glutaraMehyde. Two New Zealand white rabbits were each immunized 
by intradermal injection with a homogenate containing 1 ml of Freund complete adjuvant and 1 ml of 
conjugate (200 ug of TFI-peptide). One month later the rabbits were each boosted with a homogenate 
containing 1 ml of Freund incomplete adjuvant and 1 ml of conjugate (100 ug of conjugate). Antiserum was 
collected each week for 3 months and booster injections were performed monthly. To isolate specific 

50 antit>ody against TFI-peptide, the antiserum was chromatographed on a TFI-peptide Sepharose 4B column. 
The column was washed with 10 volumes of PBS (0.4 M NaCI-0.1 M benzamidine-1% Triton® X-100) and 
the same solution without Triton X-100, The antibody was eluted with 0.1 M glycine/HCI, pH 2.2, 
immediately neutralized by adding 1/10 volume of 1 M Tris-OH and dialyzed against saline solution. The 
isolated antibody was coupled to cyanogen bromide activated Sepharose 4B by the manufacturer's 

55 (Pharmacia) method and used to isolate TR from the c II cultur medium. 

Chang liver cell was cultured by the method described previously by Broze and Mil tich. Proc. Natl. 
Acad. Sci. USA 84 . 1886-1890 (1987). The conditioned medium was chromatographed on th anti-TFUg 
Sepharose 4B column. The column was washed with 10 volumes of PBS-1% Triton X-100 and PBS. Th 
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bound TFI was eluted with 0.1 M glycine/HCI. pH 2.2. The immunoaffinlty isolated TR was further puriOed 
by pr parative sodium dod cylsutfate poiyacryiamid gel lectrophoresis (Savant apparatus). Amino acid 
analysis of the final product showed the same amino terminal sequenc as the TFI isolated from H pG2 
cells as described in copending application. US Ser. No. 77.366. filed July 23, 1987. (priority document of 
s EP 0 300 988). The isolated Chang liv r TFI was then used to immunize rabbits by the immunization 
protocol described above. The antiserum obtained had a titer of about 100 ug/ml in the double immunodif- 
fusion test. This antiserum was used in the immuno-screening of Xgtl 1 cDNA libraries. 

Methods 

10 

Isolation of cDNA clones. Metf>ods for screening the placental and fetal liver cDNA libraries with 
antibody, plaque purification, and preparation of X-phage lysate and DNA were as described by Wun and 
Kretzmer, FEBS Lett 1 . 11-16 (1987). The antiserum was pre-adsorbed with BNN97 Xgtll lysate and 
diluted 1/500 for screening the lit>rary. 

15 

Screening of factor X, binding activity 

Recombinant proteins induced by isopropyl-^-thtogalactoside from Immuno-positive x-phage Isolates or 
from control xgtll were screened for Factor Xa binding activity. The X-phage lysates (0.1 ml) were filtered 

20 through a nitrocellulose paper using a dot-blot apparatus (Bio Bad). The nitrocellulose paper was then 
immersed and agitated n a phosphate buffered saline containing 5 mg/ml t>ovine serum albumin and 2.5 
mg/ml bovine gamma globulin at room temperature for 1 h. The solution was replaced with ^^i-F&cXor X. - 
(1.0 X 10^ cmp/ml) dissoh/ed in the same solution supplemented with 0.1 mg/ml heparin and the agitation 
continued for another hour. The nitrocellulose paper was then washed with phosphate buffered saline 

25 containing 0.05% Tween® 20. The washing buffer was changed every 5 min.. 4 times. The nitrocellulose 
paper was then air-dryed and prepared for autoradiography using Kodak XR5 film. The film was developed 
after 1 week exposure. 

Preparation of poly(A)^ RNA and Northern blotting. Total RNAs were prepared from cultured Chang liver 
cell, HepG2 hepatoma cell and SK-HEP-1 hepatoma celt using the sodium perchtorate extraction method of 

30 Lizardi. and Engelberg. Anal. Biochem. 98 . 116-122. (1979). Poly(A)* RNAs were isolated by batch-wise 
adsorption on ollgo(dT)-cetluk)se (P-L Biochemical, type 77F) using the procedure recommended by the 
manufacturer. For Northern blot analysis, 10 ug each of poly(A)^ RNA was treated with glyoxal [Thonrias. 
Methods Enzymol. 100 , 255-266 (1983)] and subjected to agarose gel electrophoresis in a buffer containing 
10 ml sodium phosphate, pH 7.0. Bethesda Research Laboratory's RNA ladder was used as a molecular 

35 weight marker. The RNAs were transblotted onto a nitrocellulose paper which was then baked at 80 * for 2 
h. The insert DNA of XP9 clone was radiolabeled with by nick translation and used as a probe [Maniatis 
et al.. 

Molecular Cloning: A Laboratory Model , Cold Spring Laboratory. Ojld Spring Hartxw. N.Y.i (1982)|. The blot 
was hybridized with 5 x 10^ cpm of the probe in 5 ml of a solution containing 50% formamide, 5X SSC, 50 

40 mM sodium phosphate. pH 7.0, 250 ug/ml denatured salmon sperm DNA, and IX Denhardfs solution at 
42* for 18 h. The filter was washed in 0.1% sodium dodecylsulfate (SDS). 2X SSC at room temperature 3 
times, each time 5 min.. and in 0.1% SDS, 0.2 X SSC at 50* twice, each 5 min. The nitrocellulose paper 
was then air dried, autoradiographed for 3 days at -70* using Kodak XAR-5 film and intensifying screen. 
Other recombinant DNA methods. Preparation of cloned Agtll DNA subcloning in pUC19 plasmid and 

45 Ml3mpia vector, generation of deletion by exonuctease 111 digestion arKj DNA sequencing by dideoxy 
method (Sanger et al., Proc. Natl. Acad. Sci. USA 83 , 6776-6780 (1977)), were performed as described by 
Wun and Kretzmer. supra . 

The program FASTP written by Upman and Pearson. Science 227 1435-1441 (1985), was used to 
identify homologous families of proteins from National Biomedical Research Foundation Sequence Data 

50 Bank (release 13. June 1987) and to align the sequences within the homotogous family. 

RESULTS 

Screening of cDNA libraries 

65 

A number of cell lines were screened for the presence of TFI in th conditioned media and it was found 
that several liver-derived cell lines, Chang liver. H©pG2 hepatoma, and SK-HEP-1 hepatoma seer te TFI in 
culture. Initially, an antiserum against TFI was used to screen a human fetal liver xgtll cDNA library (10^ 
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plaque forming unrts), and 15 immunotogically positive clones wer obtained. Subsequently, the same 
method was used to screen a placental Xgtll cDNA library. Out of 10^ plaque forming units. 10 
immunologically positive clones were obtained. These clones were plaque purified and the lysates of th 
purified clon s w re t sted for the functional activity of TFI. Ih isopropylthiogaiactosid induced phag 

5 lysates were absorbed on the nitrocellulos paper and sere n d for the *^l-Factor Xa binding activity. 
Figure 1 demonstrates that some of these immunologically positive clones showed the ability to bind the 
^^hFactor on the nitrocellulose paper. In all. 3 out of 15 immunoiogicaily positive fetal liver clones, and 4 
out of 10 immunologically positive placental clones showed ^=^l-Factor Xo binding activity. These im- 
munologically and functionally positive clones were digested with EcoRI and the size of the Inserts were 

70 estimated by gel electrophoresis. One clone from placental library (XP9) had an insert of approximately 1 .4 
kb. while all the other clones contain inserts of approximately 1.0 kb. Partial DNA sequencing has showed 
that 1.0 kb clones contain sequences identical to part of the longer 1.4 kb placental clone (XP9}. The XP9 
was therefore selected for complete sequencing. 

15 Nucleotide sequence and predicted protein sequence of TFI cDNA isolate 

The XP9 clone was subjected to restriction mapping. Ml 3 sut)Ctoning and sequencing by the strategy . 
showed in Rgure 2. The entire sequence was determined on both strands by the exonuclease III deletion 
method [Henikoff. Gene 28 , 351-359 (1984)] and found to consist of 1432 bases in length. The sequence is 

20 shown in Rgure 3. It contains a 5'-noncoding region of 133 bases, an open reading frame of 912 
nucleotides, and a 3'-noncoding region of 387 nucleotides. The first ATG occurs at nucleotide 134 in the 
sequence TAGATGA which was closely followed by a second ATG at nucleotide 146 in the sequence 
ACAATGA. These are possibly the initiation sequences, although they differ from the proposed consensus 
sequence for initiation by eukaryotic ribosome, ACCATGG [Kozak, Cell 44 . 283-292 (1986)]. Twenty-eight 

25 amino acids precede a sequence corresponding to the N-termlnal of the mature protein. The length and 
composition of the hydrophobk: segment of these 28 amino acids are typical of signal regions [Von Heijne. 
Eur. J. Biochem. 133 . 17-21 (1983); 

J. Mol. Biol. 184 . 99-105 (1985)]. A signal peptidase possibly cleaves at Ala28-A$p29 to give rise to a 
mature protein. The sequence predicted for mature TFI consists of 276 amino acids that contains 18 

30 cysteine residues and 7 methionines. The calculated mass of 31.950 Dattons based on the deduced protein 
sequence for mature TFI is somewhat lower than the 37-40 kDa estimated by sodium dodecyl sulfate 
poiyacrylamide gei electrophoresis of isolated protein, and the difference probably reflects the contribution 
of glycosylation to the nrK>bility of ttie natural protein. The deduced protein sequence corresponding to the 
mature protein contains 3 potential N-linked glycosylation sites witii the sequence Asn-X-ThrTSer (amino 

35 add positions 145. 195. and 256). Amino acid sequence analysis of purified whole TFI and two isolated 
proteolytic fragments match exactly the protein sequence deduced from cDNA sequence (Figure 3, 
underlined), indicating the isolated cDNA clone encodes TFI. The S'-noncoding region is A-^T rich (70% 
A* J). Neither consensus polyadenylation signal, AATAAA [Proudfoot and Brownlee, Nature 252 . 359-362 
(1981)] nor the poly A tall was found in this clone, possibly due to artefactual loss of part of 3* terminal 

40 portion during construction of the library. 

Charge distribution, hydrophobicity/hydrophilicity, and internal homology 

The translated amino acid sequence of the TFI contains 27 lysines. 17 arginines. 11 asparUc acids, and. 
45 25 glutamic acids. The charge distribution along the protein is highly uneven as shown in Rgure 4. The 
signal peptide region contains 2 positively charged lysine with 26 neutral residues. The ami no-terminal 
region of the mature protein contains a highly negatively charged stretch. Six of the first 7 residues are 
either aspartic acid or glutamic acid which are followed closely by two more negatively charged amino 
acids downstream before a positively charged lysine residue appears. The center portion of the molecule is 
50 generally negatively charged. At the cartx>xy terminal, there is a highly positively charged segment. Tha 
amino acids 265 to 293 of TR contain 14 positively charged amino acids including a 6-consecutive arginine 
lysine residues. 

Ttte predicted hydrophilicity/hydrophobicity profile of TFI protein is shown in Rgure 5. The signal 
peptide contains a highly hydrophobic region as expected. The rest of the molecule appears rather 
55 hydrophilic. 

The translated amirK> add sequence of TFI contains several discernible domains. Besides the highly 
negatively charged N-terminal domain and the highly positively charged C-terminal domain, the center 
portion consists of 3 homologous domains which have the typical sequences of th Kunitz-type inhibitors 
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(see below). 

Homology to other proteins 

s By searching the National Biomedical R s arch Foundation sequence data base, it was found that the 
N-terminal domain and C-terminal domain of TFI do not show significant homology to other known proteins. 
The 3 internal homologous domains, however, are each homologous to the sequences of other k>asic 
protease inhibitors including bovine pancreatic basic protease inhibitor (aprotinin). venom basic protease 
inhibitors, and inter-o-trypsin inhibitors (Figure 6). It is noteworthy that disulfide bonding structure is highly 

10 consented in ail these inhibitors. Based on these homologies. H is clear that TFI belongs to the basic 
protease inhibitor gene superfamily. 

Northem blotting 

f6 Poly(A)'t' RNAs were purified from TFi-producing liver-derived cell lines. Chang liver. HepG2 hepatoma, 
and SK-HEP-1 hepatoma cells. The poly (A)<f RNAs were resolved by denaturing agarose gel elec* 
trophoresis. transblotted onto a nitrocellulose paper and probed ¥vith ^P-labeled TFI cONA (XP9). As shown 
in Figure 7. two major l>ands of hybridization were ot>served that corresponded to mRNAs of 1.4 kb and 4.4 
kb in all three cell lines tested. Several other cell lines were tested which do not produce detectable 

20 amounts of TFI and in which no hybridization with the probe was found, (data not shown). 

Claims 

1. cDNA sequence coding for mature human tissue factor inhibitor protein having the nucleotide sequence 
25 as shown in FIG. 3 of the drawings. 

2. The cDNA sequence of Claim 1 characterized in that the cDNA is the human tissue factor inhibitor 
cDNA clone XP9 characterized as shown in the restriction map in FIG. 2 of the drawings. 

30 3. The cDNA sequence of any of Claims 1 or 2 encoding a protein having a mass of 37-40 kDattons as 
estimated by sodium dodecyl sulfate polyacrylamide gel electrophoresis and having the amino acid 
sequence as shown In FIG. 3 of the drawings. 

4. The cDNA sequence of any of Claims 1 or 2 encoding a protein which is the methionyl tissue factor 
35 inhibitor protein containing the amino acid sequence as shown in FIQ. 3 of the drawings. 

Patenia nspril che 

1. cDNA-Sequenz. die fOr ein reifes Humangewebsfaktorinhibitorprotein kodiert. mit der in Rg. 3 der 
40 Zetchnungen gezeigten Nucleotidsequenz. 

2. cDNA-Sequenz nach Anspruch 1 . dadurch gekennzeichnet. dafi die cDNA der Humangewet>sfaktorinhi- 
bitor-cDNA-Klon XP9 ist. gekennzeichnet wie in der Restriktionskarte der Rg. 2 der Zeichnungen 
gezeigt 

45 

3^ cDNA-Sequenz nach einem der Ansprtiche 1 Oder 2. welche fUr ein Protein mit einer Masse von 37-40 
kDatton kodiert. wie durch Natriumdodecylsutfat-Polyacrylamidgeielektrophorese bestimmt und mit der 
in Rg. 3 der Zeichnungen gezeigten AminosSuresequenz. 

50 4. cDNA-Sequenz nach einem der AnsprOche 1 Oder 2. welche fQr ein Protein kodiert das das 
Methtonylgewebsffaktorinhtbftorprotein ist, welches die in Rg. 3 der Zeichnungen gezeigte AminosSure- 
sequenz enthSIt 

Revendicatlons 

55 

1. Sequence d*ADNc codant une protein mature qui est un inhibiteur du facteur ttssulaire humain de la 
coagulation, et prdsentant la sequence de nucleotides indiquie sur la figure 3 des dessins annexes. 
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2. Sequence d*ADNc conform & la r v ndication 1. caract^ris^ en ce que cet ADNc est le clone XP9 
d*ADNc d'tnhibiteur du facteur tissulaire humain de la coagulation, qui present les caract^ristiques 
indiqu^s dans la carte de restriction representee sur la figur 2 d s d ssins annexes. 

5 & sequence d*ADNc conforme & l*une des r vendications 1 et 2, codant une proteina-dont la masse, 
estimee d'apr&s une eiectrophorfese en gel de polyacrylamtde-dodecylsulfate de sodium, vaut de 37 ^ 
40 kilodaltons et dont la sequence d'acides amines est celle qui est representee sur la figure 3 des 
dessins annexes. 

10 4. sequence d'ADNc conforme h Tune des revendications 1 et 2. codant une proteine qui est rinhtbiteur 
methionyie du facteur tissulaire humatn de la coagulation et qui contient la sequence d*acides amines 
representee sur la figure 3 des dessins annexes. 
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